Empirical performance maximization for linear rank statistics
نویسندگان
چکیده
The ROC curve is known to be the golden standard for measuring performance of a test/scoring statistic regarding its capacity of discrimination between two populations in a wide variety of applications, ranging from anomaly detection in signal processing to information retrieval, through medical diagnosis. Most practical performance measures used in scoring applications such as the AUC, the local AUC, the p-norm push, the DCG and others, can be seen as summaries of the ROC curve. This paper highlights the fact that many of these empirical criteria can be expressed as (conditional) linear rank statistics. We investigate the properties of empirical maximizers of such performance criteria and provide preliminary results for the concentration properties of a novel class of random variables that we will call a linear rank process.
منابع مشابه
Estimation and empirical performance of non-scalar dynamic conditional correlation models
This paper presents a method capable of estimating richly parametrized versions of the dynamic conditional correlation (DCC) model that go beyond the standard scalar case. The algorithm is based on the maximization of a Gaussian quasi-likelihood using a Bregman-proximal trust-region method to handle the various non-linear stationarity and positivity constraints that arise in this context. We co...
متن کاملBayesian Learning for Low-Rank matrix reconstruction
We develop latent variable models for Bayesian learning based low-rank matrix completion and reconstruction from linear measurements. For under-determined systems, the developed methods are shown to reconstruct low-rank matrices when neither the rank nor the noise power is known a-priori. We derive relations between the latent variable models and several low-rank promoting penalty functions. Th...
متن کاملDetection of Outliers and Influential Observations in Linear Ridge Measurement Error Models with Stochastic Linear Restrictions
The aim of this paper is to propose some diagnostic methods in linear ridge measurement error models with stochastic linear restrictions using the corrected likelihood. Based on the bias-corrected estimation of model parameters, diagnostic measures are developed to identify outlying and influential observations. In addition, we derive the corrected score test statistic for outliers detection ba...
متن کاملMaximization of Empirical Shannon Information in Testing Significant Variables of Linear Model
Search for an unknown set A; Card(A) = s, of signiicant variables of a linear model with random IID discrete binary carriers and nitely supported IID noise is studied. Two statistics T 1 ; T s ; based on maximization of Shannon Information (SI) of the corresponding classes of joint empirical input-output distributions , are proposed inspired by the related study in Csiszar and KK orner (1981). ...
متن کاملGuarantees for Greedy Maximization of Non-submodular Functions with Applications
We investigate the performance of the GREEDY algorithm for cardinality constrained maximization of non-submodular nondecreasing set functions. While there are strong theoretical guarantees on the performance of GREEDY for maximizing submodular functions, there are few guarantees for non-submodular ones. However, GREEDY enjoys strong empirical performance for many important non-submodular functi...
متن کامل